Continuous transition of social efficiencies in the stochastic strategy minority game 
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We show that in a variant of the minority game problem, the agents can reach a state of maximum 
social efficiency, where the fluctuation between the two choices is minimum, by following a simple 
stochastic strategy. By imagining a social scenario where the agents can only guess about the 
number of excess people in the majority, we show that as long as the guessed value is sufficiently 
close to the reality, the system can reach a state of full efficiency or minimum fluctuation. A 
continuous transition to less efficient condition is observed when the guessed value becomes worse. 
Hence, people can optimize their guess for excess population to optimize the period of being in 
the majority state. We also consider the situation where a finite fraction of agents always decide 
completely randomly (random trader) as opposed to the rest of the population who follow a certain 
strategy (chartist). For a single random trader the system becomes fully efficient with majority- 
minority crossover occurring every 2 days on average. For just two random traders, all the agents 
have equal gain with arbitrarily small fluctuations. 



I. INTRODUCTION 

The minority game (MG) [H-Ill is a game of repeated 
choice of N (odd) non-communicating agents where the 
agents ending up in the minority receive positive pay- 
offs. This is a variant of the El Parol bar problem Q. 
Existence of a phase transition has already been reported 
in the MG pj. 

One of the aims of the study of MG is to bring the pop- 
ulations in either choices sufficiently close to {N — l)/2 
and in doing so to keep the time of convergence and fluc- 
tuations low ^3|, ti] . As is readily seen, a random choice 
would make the time of convergence virtually zero, but 
the fluctuation would be of ^/N order. This makes the 
system socially very inefficient. This is because, although 
Nl2 agents could be in the minority at the same time, 
due to fluctuation this number may be much smaller than 
that, thereby making the system socially inefficient due 
to poor resource utilization. To increase social efficiency, 
this fluctuation has to be minimised. Several adaptive 
strategies were studied to improve upon this situation. 
However, regarding the fluctuation, even for the most 
complex strategies only the pre-factor could be made 
smaller and the fluctuation still scales with \/N . A large 
misuse of resource is therefore likely in this situation. 

There have been some recent attempts to study a 
generalization of the MG problem in the Kolkata paise 
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restaurant (KPR) problem, where there are N agents and 
TV' choices (restaurants). Keeping a finite comfort level 
(equal in most cases) for the restaurants (say, one agent 
per choice) one arrives at similar problem of finding the 
most efficient strategy (non-dictated) for resource alloca- 
tion (B-Q- It was shown that a simple crowd-avoiding, 
stochastic, non-dictated strategy led to the most effi- 
cient resource allocation for the problem in a very short 
convergence time (practically independent of the system 
size) 

Stochastic strategies have been used also for the MG 
problem IQ]. Recently, Dhar et al. [ll| used the KPR 
strategy in the MG problem to find that the fluctuation 
could be made arbitrarily small in a convergence time of 
the order log log A^. This, of course, is the best possible 
strategy so far in terms of resource allocation in the MG. 
However, this situation differs a bit from the classic MG 
problem. The most important difference is that, being a 
crowd- avoiding strategy, it requires that the agents know 
not only whether they were in the minority or major- 
ity the previous evening, but also the number of excess 
people in the majority if he or she were there. 

The present study intends to determine if knowing the 
number of excess is indeed necessary in reaching a state 
with minimum fluctuations. We show that even in a more 
realistic situation where exact knowledge of the number 
of excess people is not available to the agents, it is pos- 
sible to reach a state with minimum fluctuations. A nat- 
ural relaxation of the complete knowledge of the excess 
crowd would be to make a guess about the actual value. 
We show analytically for an idealized situation, where 
all agents make the same guess, that if the guess value 
is smaller than twice the real value, the minimum fluc- 
tuation state (or the absorbing state) can be reached. 
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However, for any larger error, a residual fluctuation pro- 
portional to the excess error made, stays in the system 
in the steady state. Further, in a generalized and more 
realistic version, where the guesses differ and are ran- 
dom for each agent in each step (annealed disorder), we 
analytically and numerically find similar behavior in the 
fluctuations in the system only in terms of the average 
guess value this time. We show that depending upon the 
guessing power of the agents, there is a continuous phase 
transition between full (minimum fluctuation or absorb- 
ing phase) and partial (system with residual fluctuation 
or active phase) efficiencies for resource allocation in this 
problem. We also analyze the case where the knowledge 
of the excess population is completely unknown to the 
agents. It is shown both analytically and numerically 
that zero fluctuation can be reached by following an an- 
nealing schedule. To further incorporate the effects of 
real situations, we consider the fraction of agents who 
decide completely randomly. When the number of ran- 
dom traders is 1, although the fluctuation can be at its 
minimum, he or she is always in the majority. However, 
for more than one random traders, this situation can be 
avoided, maintaining a minimal fluctuation. 

The rest of the paper is organized as follows: In Sec.llll 
we discuss different strategies followed by the agents to 
minimize the fluctuation. We show that by tuning a 
suitable parameter, an active- absorbing phase transition 
takes place. We elaborate on other strategies where the 
agent's knowledge of the excess population is only par- 
tial or is completely absent. In Sec. HI we discuss the 
effects of random traders. Finally, in Sec. IV, we give 
concluding remarks. 



II. STRATEGY OF THE AGENTS 

In the strategy followed in Ref. [llj, the agents in the 
majority shift with the probability 



P+ 



A{t) 



M + A{t) + r 



(1) 



and the agents in the minority remain with their choice 
{p- = 0), where the total population {N = 2M + 1) 
is divided as M -I- A(i) -t- 1 and M - A(t) with A{t) = 
{\NA{t) - Neit)] - l)/2, where 7Va(<) and Nsit) are the 
populations in the two choices at time t. In this strategy, 
the agents can reach the zero fluctuation limit in log log N 
time Although the resource utiliszation is maximum 
in that case, its distribution is highly asymmetric in the 
sense that after the dynamics stops in the A{t) — limit, 
the agents in the minority (majority) stay there for ever; 
hence, only one group always beneflts. Apart from this, 
in this strategy the knowledge of A{t) is made available 
to all the agents, which is not a general practice in MG. 

In the following subsections, we introduce several vari- 
ants of the above mentioned strategy. Primarily we in- 
tend to find if it is possible to avoid the freezing of the 



dynamics while keeping the fluctuation as low as possi- 
ble. We then discuss if it is possible to achieve such states 
without knowing the magnitude of A{t). 



Uniform approximation in guessing the excess 
crowd 



Let us assume that the agents know the value of A{t). 
We then intend to find a strategy where the dynamics of 
the game does not stop and the fluctuation can be made 
as small as required. 

To do that we propose the following strategy: The 
shifting probability of the agents in majority is 



A'(t) 



M + A'{t) + 1' 



(2) 



[where A'{t) — gA{t) and 5 is a constant] and that from 
the minority remains zero. We will see that a steady 
state can be reached in this model where the fluctuation 
can be arbitrarily small. 



1. Steady-state behavior 

Now, to understand when such a steady state value is 
possible, note that when the transfer of the crowd from 
majority to minority is twice the difference of the crowd, 
the minority then will become the majority and will have 
the same amount of excess people as in the initial state. 
Quantitatively, if the initial populations were M + A and 
M — A roughly, and if 2A is shifted from majority to mi- 
nority, then the situation would continue to repeat itself, 
as the transfer probability solely depends on the excess 
crowd. Of course, this is possible only when g > 1. More 
formally, if the steady-state value of A(t) is A,, then the 
steady state condition would require 



(M + A, + 1) 



= 2A.,. 



M + gAs + l 
This, on simplification yields either As = or 

A, = ^ Af + 1 . 

g 



(3) 



(4) 



Clearly, for g < 2{= gc), Ag — would be the valid solu- 
tion, since the above equation predicts a negative value 
for As, which indicated no steady-state saturation until 
it decreases to zero. Therefore, one can predict a phase 
transition of the active- absorbing type [l2| by tuning the 
value of g. When < g < 2, the system will reach the 
minimum fluctuation state where A(t) = and the dy- 
namics stops (the dynamics will differ qualitatively for 
g < 1 and g > 1; see the appendix). For g > 2, how- 
ever, a residual fluctuation will remain in the system, 
keeping it active. Physically, this would mean that until 
the guessed value of the crowd is not too incorrect (twice 
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FIG. 1: Steady state values of the order parameter Os for 
different values of g and x. The solid lines shows the analyt- 
ical results for the pure and annealed disordered cases. Both 
match very well with the simulation points. Inset shows the 
log-log plot near the critical point for the disordered case, 
confirming — 1.00 ±0.01. All simulation data are shown for 
M = 10^ 



as large), the agents can still find the minimum fluctua- 
tion state. However, when the guess becomes too wild, a 
fluctuation remains in the system. 

Therefore, it is now possible to define an order param- 
eter for the problem as 0{t) = A{t) /M and its saturation 
values behaves as = when g < 2 and when M ^ 1, 
for g > 2, with Os = {g—gc)/ 9 giving the order parameter 
exponent /3 = 1 for this continuous transition. In Fig. [1] 
we plot the results of numerical simulation [M — 10^) as 
well as the analytical expression for the order parameter. 
We find satisfactory agreement. 



2. Dynamics of the system 

In this simple situation, it is possible to calculate the 
time dependent behavior of the order parameter both at 
and above the critical point. Suppose, at a given instant 
i, the populations in the two choices A and B are NA{t) 
and NB{t) respectively with NA{t) > Nsit). By defini- 
tion 



Mt) 



NA{t) ~ Neit) - 1 



(5) 



Moreover, the amount of the population to be shifted 
from A to B using this strategy would be 



Sit) 



M + gA{t) 



-{M + A{t) + 1) 



(6) 



when A{t) is small compared to M, i.e., when g is close 
to gc or for large time if g < gc- 

Clearly, 7V^(t -I- 1) = A^a(0 - S{t) and Neit + 1) = 
Nsit) + S{t), giving (assuming population inversion; see 



the appendix for a general treatment) 

Neit + I) - NAit + 1) - 1 



A{t + 1) 



gA{t) - A{t) - 1. 



(7) 



Therefore, the time evolution of the order parameter 
reads 



dOjt) 
dt 



-(2-5)0(t) 



1 

M' 



Neglecting the last term and integrating, one gets 
0(0 =O(0)exp[-(2-g)t]. 



(8) 



(9) 



The above equation signifies an exponential decay of the 
order parameter in the subcritical region (1 < 17 < 2). 
It also specifies a time scale as r ~ (gc — g)~^ diverging 
as the critical point is approached. These behaviors are 
confirmed by numerical simulations. 

In Eq. ([6]), the approximation was made to keep the 
leading order term only. If, however, the first correction 
term is kept, the expression becomes 

Sit) « gA{t) ~ ^{g'A'it) ~ gA'it)). (10) 

The time evolution equation of the order parameter then 
becomes 

^ = -{2- g)0{t) - g{g - l)0^t) - j^. (11) 

Now, if we consider the dynamics exactly at the critical 
point, i.e., g = 2, then the first term in the right-hand- 
side is zero. The last term can be neglected. Therefore, 
the order parameter becomes 



0{t) = 



0(0) 



2O(0)t + 1' 



(12) 



In the long time limit 0{t) ^ t^^, giving ^ = 1. 

Therefore we see that under this simple approxima- 
tion, the usual mean field active-absorbing transition ex- 
ponents are obtained. These are confirmed using numer- 
ical simulations as well. A general solution of the dy- 
namics (valid for all g values at all times) is shown in the 
Appendix, which in the limiting cases yield the results 
mentioned above. 



B. Nonuniform guessing of the excess crowd 

In the above mentioned strategy one can find a steady 
state for any value of the fluctuation. However, unlike 
the common practice in MG, the value of A{t) is exactly 
known to all the agents. Here we consider the case 
where each agent can only make a guess about the value 
of A{t). Therefore for the i-th agent 



A,{t)^A{t){l + e,), 



(13) 
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FIG. 2: Data collapse for finding v in the disordered case for 
different x values. The estimate vs, v = 1.00 ± 0.01. Inset 
shows the uncoUapsed data. The straight line at the critical 
point gives (5 = 1.00 ± 0.01. Simulation data is shown for 
M = lO''. 




FIG. 3: Data collapse for finding z in the disordered case 
for different system sizes (M = 10^, 10*, 10^, 10®) at a: = 1.0. 
The estimate is zjd = 0.50±0.01. Inset shows the uncoUapsed 
data. The linear part in the inset confirms 5 = 1.00 ± 0.01. 



where is a uniformly distributed random number in the 
range [0 : 1t\ and is an annealed variable (i.e., changes 
at each time step randomly) with \x\ > 0. Clearly, 

(A,(i))=A(0(l + (e,)) = A(0(l+x), (14) 

2x 

where (Q) {— l/2x J Qde) denotes average of Q over 



randomness. With the analogy of the previous case, we 
expect a transition from zero to finite activity near Xc — 
1. In Fig. [1] we plot the steady state values of the order 
parameter against (a; + 1). Clearly, the active- absorbing 
transition takes place at = 1 (note that this is the 
same point ffc = 2 where the transition for the pure case 
took place). 

Note that irrespective of whether population inversions 



with 



S{t) 



A{t + l) = \A{t)-S{t)\ 
Am + e) 



M + A(i)(l + e) 



This leads to 



Oit + l) = Oit) 



l + (l + e)0(t) 



(15) 
(16) 

(17) 



First consider the steady state where 0(t+l) = 0{t) = 
O* (above the critical point). After simplification, the 
above equation reduces to 



(1 - 0*)2xO* 
(1 + 0*) 



In 



1 



2xO* 
l + O* 



(18) 



One can numerically compare the solution of this equa- 
tion with the simulations, which agrees well (see Fig. [ij. 
A small O* expansion of the above equation yields O* ^ 
{x - 1), giving /3 = 1. 

Also, for small 0{t) the dynamical equation would 
yield (at or above the critical point) 



dOjt) 
dt 



{x-l)0{t)-xO^{t). 



(19) 



Now, by neglecting the square term (in presence of the 
linear term above the critical point) one would obtain 
v = 1 and by keeping the square term (in absence of the 
linear term at the critical point) one would obtain 6 = 1. 
Thus all the exponents of the pure case are recovered for 
annealed disorder. 

One can numerically verify the above exponent values 
using the following scaling form of the order parameter 
(writing x + 1 = g) 



0{t)^t-'F{t^/''{g~g,),t'"^/N 



(20) 



where d is the space dimension, which we take to be 4 
in this mean field limit. At the critical point, the order 
parameter follows a power-law relaxation 0{t) ~ (see 
inset of Fig. [2] ) with 5 = 1.00 ± 0.01. In Fig. [2] we plot 
0{t)t^ against t{g — gcY . By knowing 5, v can be tuned 
to get data collapse. The estimate of v is 1.00 ± 0.01. 
Similarly, in Fig. Owe plot 0{t)t^ against t/N''/'^. Again 
by tuning z, data collapse can be found. The estimate 
for z/d comes out to be 0.50 ± 0.01. Thus the analytical 
estimates are verified and the scaling relation 5 = is 
satisfied. 

We have also tried the case of quenched disorder (e's 
are fixed for the agents for all time). Above the criti- 
cal point, when population inversion occurs, this would 
imply that agents with higher e would change side with 
higher probability and are more likely to be in the major- 
ity. A plot of average pay-off for agents having different 
e values verify this statement by showing the monotonic 
decay of the pay-off with increasing e (Fig. 01). 
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FIG. 4: For quenched ei (in Eq. (|13ll ') the average pay-offs of 
the agents are plotted for different e values having different 
ranges as indicated. The monotonic decay with increasing e 
clearly indicates that agents with higher e are more likely to 
be in the majority (see last para of Sec. IIC). 



C. Following an annealing schedule 

In MG, information about the excess crowd is gener- 
ally unknown to the agents. In the strategies mentioned 
above, the excess population is known to the agents ei- 
ther exactly or approximately. Here we consider the case, 
where knowledge of A(t) is not known to the agents. 

The agents follow a simple time evolution function for 
the time variation of the excess population A'^{t). To 
begin with, we consider an annealing schedule 



A^(i) = A^(0)exp(-t/r). 



(21) 



where A-^(O) is taken close to -\/M. In Figs. [S] we plot the 
time variation of the actual value of excess population as 
weh as A^(t). We see that A(<) decreases very quickly. 
Furthermore, it appears that there is a simple relation 
between A^(i) and A{t) such that 



2A{t) = A^{t). 



(22) 



This implies that A{t) ^ 1 in logA^ time. Therefore, 
in this strategy, even if the actual value of the excess 
crowd is not supplied to the agents, they can find a state 
where the fluctuation practically vanishes. We have also 
checked whether this is true for some other functions as 
well. We have taken functional forms such as 



A^{t) = 



A^(0) 
(1 + t)"' 



(23) 



For all cases mentioned above we plot (see Figs. [5]) 2A(t) 
and A'^it) to check if they are equal. We conclude that 
this relation is not dependent on the functional form of 
A'^{t) (as long as it is not too fast; see discussions below). 

The response of the order parameter to the assumed 
trial function can be somewhat understood as follows: 
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FIG. 5: Time variation of the excess population A(t) are 
plotted for different functional forms of A'^{t). Left: In log- 
linear scale the excess population are plotted for exponential 
decay. Right: For power law (A^(0)/(1 + t)" decay, with 
different values of a). M — 5 x 10^ for the simulations. 



The dynamical equation for 0(t) would be 

m-oit)\ 



0{t + l) = 



1 + ry(<) 



(24) 



where ?7(t) = A^{t)/M. Considering the case ry(t) > 0{t) 
(when population inversion takes place) one would arrive 
at 



dO{t) 
dt 



{V{t)-2)0{t)^v{t){l-v{t)) 



(25) 



A general solution of the above equation will be of the 
form 



0{t) 





/(2-,,(ti))dti 
-/(2-»)(ti))dti 



(26) 



where Ci is a constant. This continuum limit is valid 
only for the functions ri{t) that do not decay too fast. 
Considering ri(t) < 2, one can show that the dominant 
term of the solution will be of the form 



0{t) 



yjt) (1 - yjt)) ^ 77(0 
2~r]{t) ^ 2 '■ 



(27) 



as seen numerically. However, evaluation of Eq. (|26p for 

ri{t) ~ r]Q exp{-~t / t) for r > 1/2 gives 



0{t) 



2t- 1 



(28) 



Therefore, 0{t) - ?7(t)/2 (as in Eq. ([22])) is only true 
when T 3> 1/2, which is the measure of slowness required 
in ri{t) to reduce 0{t). 

In the case where 0(0) is large or 77 (i) decays too fast 
(than the limit mentioned above), one would simply have 
(following Eq. ([Ml) ) 



t-i 



0{t) ^ Oit - 1) - r^it - 1) ^ 0(0) - ^(^)- (29) 



/c=0 
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So, 0{t) would in fact saturate to a finite value and no 
population inversion will take place. This is also seen 
numerically. 

III. EFFECT OF RANDOM TRADERS 

According to the strategies mentioned above, if the 
excess population is known to the agents (which in this 
case is in fact a measure of the stock's price) the fluctua- 
tions can have any small value. However, in real markets, 
there are agents who follow certain strategies depending 
on the market signal (chartists) and also some agents may 
decide completely randomly (random traders). Here we 
intend to investigate the effect of having random traders 
in the market, while the rest of the populations follow 
the strategies mentioned above. 

Single random trader 
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FIG. 6: The saturation values of Os are plotted against g for 
different fractions p of the random traders. M = 10^ for the 
simulations. 



When a single random trader is present, even when 
A(i) — 0, that trader would choose randomly irrespective 
of whether he or she is in the minority or majority. This 
will create a crossover between majority and minority 
with an average time of two time steps. In this way, the 
asymmetry in the resource distribution can be avoided. 
However, that single agent will always be in the majority. 



Furthermore, if one incorporates the random traders in 
the strategy described in Sec. IIB, even the knowledge of 
the excess population will not be exactly needed to reach 
a state of very small fluctuations. 



IV. SUMMARY AND DISCUSSIONS 



More than one random traders 

As is discussed in Sec. II, when all agents follow the 
strategy described by Eq. ([T|), after some initial dynam- 
ics, A(i) = implying that they do not change side at all. 
However, with a single random trader, in an average time 
period 2, as he or she selects alternatively between the 
two choices, the rest of the population is divided equally 
between the two choices and it is the random trader who 
creates the majority by always making himself or her- 
self a loser. This situation can be avoided when there 
is more than one random trader. In that case, it is not 
possible always to have them in the majority. There will 
be configurations where some of the random traders can 
be in the minority, making their time period of wining to 
be 2 (due to the symmetry of the two choices) . The ab- 
sorbing state (for g < gc), therefore, never appears with 
random traders, though the fluctuation becomes non-zero 
for more than one random traders. However, if the num- 
ber of random traders (= pN, where p is the fraction 
of random traders) is increased, the fluctuation in the 
excess population will grow eventually to A^^/^(see Fig. 
[S]). Therefore, the most effective strategy could be the 
one in which (i) the fluctuation is minimum and (ii) the 
average time period of gain will be 2 for all the agents, 
irrespective of the fact whether they are random traders 
or chartists. These two are satisfied when the number of 
random traders is 2. 



In the stochastic strategy minority game, a very efh- 
cient strategy is described by Eq. ([T]), where the agents 
very quickly (in log log N time) get divided almost 
equally (M and M + 1) between the two choices. This 
strategy guarantees that a single cheater, who does not 
follow this strategy, will always be a loser [ll|. How- 
ever, the dynamics in the system stops very quickly, mak- 
ing the resource distribution highly asymmetric (peo- 
ple in the majority stays there for ever) thereby mak- 
ing this strategy socially unacceptable. We present here 
modifications in the above mentioned strategy to avoid 
this absorbing state. The presence of a single random 
trader (who picks between the two choices completely 
randomly) will avoid this absorbing state and the asym- 
metric distribution. However, this will always make that 
random trader a loser. But the presence of more than 
one random trader will avoid that situation too, making 
the average time period of switching between majority 
and minority for all the traders (irrespective of whether 
they are chatists or random traders) to be 2. 

We show (in Sec. [Hi that by varying a parameter, the 
agents can achieve any value of the fluctuation. This 
is an active-absorbing type phase transition and we also 
find the critical exponents analytically (P = 1, v = 1, 
S ~ I), which are well supported by numerical simula- 
tions. Then we go on to reduce the knowledge of the 
agents about the excess populations, which was exactly 
known to the agents in the earlier strategies. We assume 
that the agents can only make a guess about the excess 
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population. We show using numerical simulations and 
also using approximate analytical calculations that when 
value of the average guess of the agents are not too bad 
(less than twice the actual value) they can still reach the 
state of zero fluctuation. Once again the fluctuation val- 
ues increase continuously, when the guess becomes worse. 
This is again an active- absorbing type phase transition 
with similar values for critical indices. Next we consider 
the case when the knowledge of excess crowd is com- 
pletely absent for the agents. In this case the agents as- 
sume a time variation (annealing schedule, see Sec. IIC) 
for the excess population and they do not look at the 
actual value. It is shown for several choices of the func- 
tional form that the actual value of the excess population 
essentially follows the assumed form and thereby goes to 
zero in finite time (depending on the assumed functional 
form) . Again we show both analytically and numerically 
that for slow enough annealing schedule the attainment 
of zero fluctuation in log N time can be guaranteed. Fi- 
nally, as mentioned before, we also consider the effect of 
having random traders in the market who decide abso- 
lutely randomly in Sec. III. 

We have presented several stochastic strategies for Mi- 
nority Game. We have shown that utilizing the existence 
of a continuous transition (from Ag = to A.; 7^ 0) 
the agents can find the state of zero or arbitrarily small 
amount of fluctuation (thereby making the system so- 
cially efficient) in logA'^ time with or without knowing 
the excess population. Presence or evolution of a few 
(minimum 2) random traders in the population will not 
only help the entire population to get out of the absorbing 
state (for g < Qc), but also will have a minimal fluctua- 
tion or maximum social efficiency. 

Stated briefly, as a population of N agents try to evolve 
a strategy such that each of them individually belong to 
minority among the two choices A and B, N — 2 of them 
will follow the stochastic strategy given by Eq. ([T]) , while 
the rest 2 will follow a purely random strategy. This will 
ensure, as shown here, that fluctuation Ag (~ \Na — Nb\) 
will have arbitrarily small value (giving maximum social 
efficiency) achieved in log N time and the absorbing state 
will never appear while everyone will have an average pe- 
riod of 2 in the minority/majority. Decrease in the num- 
ber of random traders will either enforce the absorbing 
state or indefinite stay in the majority for the random 
trader, while increase in the number beyond 2 will in- 
crease As, eventually converging to its Vn value. As 
shown, even precise value of A(t) in Eq. ([!]) in unneces- 
sary and appropriate guesses (see Sec. IIB) or appropri- 
ate annealing of A{t) (see Sec. IIC) can achieve almost 
the same results. 
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Appendix A: General solution of the dynamics 

In the calculations leading to Eqs. (|H]) and (|lll) we 
have assumed that the two choices alternatively becomes 
minority and majority (population inversion happens). 
That of course is our concern while studying the active 
phase {g > gc)- However, as is clearly seen from the 
strategy, this population inversion only happens when 
g > 1. So, a more general solution of the dynamics valid 
for all g can be done as follows: Consider the auxiliary 
variable u{t) = \g - l\VO{t). Putting this in Eq. ^ 
and neglecting 1/M term, one arrives at the recursion 
relation 



u{t + 1) = u{t) + g\g ~ 1\K 



Clearly, 



u{t) = u(0) + 



.g(l-|g~l|*) 



which leads to 

1-1.9-11 



0{t) 



(Al) 



(A2) 



(A3) 



For the special case oi g = 2 one gets back Eq. 
Also, one can always define the time scale for the above 
equation as 



1 



\H\9-m' 



(A4) 



Near the critical point 5 — > = 2, the already obtained 
power-law divergence ((17 — gc)~^) is recovered. 

Finally, for g < 1, the dynamical equation will reduce 

to 



(A5) 
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